Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente
Identifieur interne : 001191 ( Main/Exploration ); précédent : 001190; suivant : 001192Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente
Auteurs : Ein Beitrag Von Klaus Lepsky [Allemagne] ; John Vorhauer [Allemagne]Source :
- ABI - Technik [ 0720-6763 ] ; 2006.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
Lingo is an open source software system for automatic indexing of german language documents. The development was determined by the aspects of flexibility, easy configuration, and different applications. The contribution deals with the advantage of a linguistic based automatic indexing system which will improve information retrieval. The available linguistic functionality of lingo is presented and explained via examples. Stemming, recognition and separation of composite words, lexical and algorithmic recognition of phrases and correction of OCR defaults are indicated too. Lingo's open system architecture, possible fields of application, and their boundaries are described.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000382
- to stream PascalFrancis, to step Curation: 000404
- to stream PascalFrancis, to step Checkpoint: 000364
- to stream Main, to step Merge: 001225
- to stream Main, to step Curation: 001191
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="GER" level="a">Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente</title>
<author><name sortKey="Von Klaus Lepsky, Ein Beitrag" sort="Von Klaus Lepsky, Ein Beitrag" uniqKey="Von Klaus Lepsky E" first="Ein Beitrag" last="Von Klaus Lepsky">Ein Beitrag Von Klaus Lepsky</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Institut für Informations- wissenschaft Fachhochschule Köln Claudiusstrasse 1</s1>
<s2>50678 Köln</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Vorhauer, John" sort="Vorhauer, John" uniqKey="Vorhauer J" first="John" last="Vorhauer">John Vorhauer</name>
<affiliation wicri:level="3"><inist:fA14 i1="02"><s2>Gustavstrasse 6 50937 Köln</s2>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">06-0316033</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 06-0316033 INIST</idno>
<idno type="RBID">Pascal:06-0316033</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000382</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000404</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000364</idno>
<idno type="wicri:doubleKey">0720-6763:2006:Von Klaus Lepsky E:lingo:ein:open</idno>
<idno type="wicri:Area/Main/Merge">001225</idno>
<idno type="wicri:Area/Main/Curation">001191</idno>
<idno type="wicri:Area/Main/Exploration">001191</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="GER" level="a">Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente</title>
<author><name sortKey="Von Klaus Lepsky, Ein Beitrag" sort="Von Klaus Lepsky, Ein Beitrag" uniqKey="Von Klaus Lepsky E" first="Ein Beitrag" last="Von Klaus Lepsky">Ein Beitrag Von Klaus Lepsky</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Institut für Informations- wissenschaft Fachhochschule Köln Claudiusstrasse 1</s1>
<s2>50678 Köln</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Vorhauer, John" sort="Vorhauer, John" uniqKey="Vorhauer J" first="John" last="Vorhauer">John Vorhauer</name>
<affiliation wicri:level="3"><inist:fA14 i1="02"><s2>Gustavstrasse 6 50937 Köln</s2>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">ABI - Technik</title>
<title level="j" type="abbreviated">ABI - Tech.</title>
<idno type="ISSN">0720-6763</idno>
<imprint><date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">ABI - Technik</title>
<title level="j" type="abbreviated">ABI - Tech.</title>
<idno type="ISSN">0720-6763</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Automatic indexing</term>
<term>Document processing</term>
<term>German</term>
<term>Information retrieval</term>
<term>Language processing</term>
<term>Linguistic analysis</term>
<term>Open source software</term>
<term>System architecture</term>
<term>System description</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Logiciel libre</term>
<term>Traitement document</term>
<term>Traitement langage</term>
<term>Allemand</term>
<term>Indexation automatique</term>
<term>Recherche information</term>
<term>Analyse linguistique</term>
<term>Architecture système</term>
<term>Description système</term>
<term>Domaine d'application</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Lingo is an open source software system for automatic indexing of german language documents. The development was determined by the aspects of flexibility, easy configuration, and different applications. The contribution deals with the advantage of a linguistic based automatic indexing system which will improve information retrieval. The available linguistic functionality of lingo is presented and explained via examples. Stemming, recognition and separation of composite words, lexical and algorithmic recognition of phrases and correction of OCR defaults are indicated too. Lingo's open system architecture, possible fields of application, and their boundaries are described.</div>
</front>
</TEI>
<affiliations><list><country><li>Allemagne</li>
</country>
<region><li>District de Cologne</li>
<li>Rhénanie-du-Nord-Westphalie</li>
</region>
<settlement><li>Cologne</li>
</settlement>
</list>
<tree><country name="Allemagne"><region name="Rhénanie-du-Nord-Westphalie"><name sortKey="Von Klaus Lepsky, Ein Beitrag" sort="Von Klaus Lepsky, Ein Beitrag" uniqKey="Von Klaus Lepsky E" first="Ein Beitrag" last="Von Klaus Lepsky">Ein Beitrag Von Klaus Lepsky</name>
</region>
<name sortKey="Vorhauer, John" sort="Vorhauer, John" uniqKey="Vorhauer J" first="John" last="Vorhauer">John Vorhauer</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001191 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001191 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:06-0316033 |texte= Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente }}
This area was generated with Dilib version V0.6.32. |